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Abstract —In a centralized or cloud radio access network, 
certain portions of the digital baseband processing of a group 
of several radio access points are processed at a central data 
center. Centralization improves the flexibility, scalability, and 
utilization of computational assets. However, the performance 
depends critically on how the limited data processing resources 
are allocated to serve the needs of the different wireless devices. 
As the processing load imposed by each device depends on its al¬ 
located transmission rate and channel quality, the rate-allocation 
aspect of the scheduling should take into account the available 
computing resources. In this paper, two computationally aware 
schedulers are proposed that have the objective of maximizing the 
system sum-rate while satisfying a constraint on the offered com¬ 
putational load. The first scheduler optimally allocates resources 
and is implemented according to a water-filling algorithm. The 
second scheduler is suboptimal, but uses a simpler and intuitive 
complexity-cut-off approach. The performance of both schedulers 
is evaluated using an LTE-compliant system level simulator. It 
is found that both schedulers avoid outages that are caused by 
an overflow of computational load (i.e., computational outages) 
at the cost of a slight loss of sum-rate. 

I. Introduction 

The 5* generation of mobile communication networks will 
be accompanied by a paradigm shift towards virtualization 
and “cloudification”. This trend is already underway: Network 
Eunction Virtualization (NFV) is being developed (see, for 
example, use case #6 in fl]), while requirements studies on 
5G consider network and service flexibility as one of the key 
requirements 121. One technology that offers this flexibility 
in the radio access network (RAN) is Cloud-RAN, which 
centralizes part of the radio network functions in order to 
exploit centralization and coordination gains 13 . 

In a Cloud-RAN network, the radio protocol stack is ex¬ 
ecuted by different physical entities. Eor a given group of 
cells, the lower parts of the protocol stack are executed at 
the radio access points (RAPs), while the upper parts are 
executed at a central entity, called the Cloud-RAN platform. 
The RAPs use dedicated hardware, while the Cloud-RAN may 
be implemented with commodity general-purpose hardware. 
Fig. [T] illustrates the high-level architecture of a Cloud-RAN 
system. The physical RAPs are connected via a backhaul 
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Fig. 1. An exemplary Cloud-RAN architecture. 


network (often also referred to as fronthaul) to the Cloud- 
RAN platform, which executes the upper parts of the protocol 
stack (denoted as virtualized RAP functions). The virtual 
infrastructure provides on-demand computing resources, e.g. 
in form of processors, to the virtual RAPs a, 13. For this 
paper, it is important to note that the forward error correction 
(EEC), which is amongst the most computationally intensive 
RAN functions, is executed on the Cloud-RAN platform. 

The Cloud-RAN platform may dynamically assign data 
processing resources to each RAP based on its processing 
demand. This implies the need for a mechanism which es¬ 
timates the computational load of the aggregated cells and 
assures that the computational resources are sufficient. In 
0, Zhu et al. demonstrate the general feasibility of this 
approach with a WiMAX base station implemented on general 
purpose processors (GPPs). In Jb), Bhaumik et al. provide a 
numerical analysis of the required data processing resources to 
operate a 3GPP LTE RAN protocol stack. In Q, Werthmann 
et al. present a method based on admission control to manage 
the required data processing resources of a fully centralized 
RAN. In il, il, we investigate the trade-off between invested 
data processing resources and achievable data rates, taking 
into account specifically the processing requirements of the 
EEC on the uplink. Using this framework, the system can be 
dimensioned according to the probability that the system has 
insufficient data processing resources to process all incoming 
transmissions, a condition that is called computational outage. 



















In order to avoid a scenario where uplink transmissions 
are dropped due to computational outage, this paper proposes 
two resource-allocation strategies, which allocate resources to 
uplink users under a computational-complexity constraint with 
the goal of maximizing the overall throughput of the network. 
By resource allocation we refer to the allocation of rate to each 

user. Due to the correspondence between rate and computa¬ 
tional load 0 , ii, the rate allocation problem is equivalent 
to allocating computational resources. The formulation leads 
to a water-filling approach to the allocation. 

The paper is organized as follows. Section |II] introduces 
the system model and computational complexity framework. 
Section m derives the resource allocation strategies. Sec¬ 
tion |IV] describes the system-level simulator and provides a 
comparison of two scheduling strategies and a benchmark 
algorithm. The paper concludes in Section |V] 

II. System Model 

Consider a system with Nc RAPs, which are centrally 
processed at the Cloud-RAN platform. For the sake of a 
simplified notation, assume that each RAP serves exactly 
one user equipment (UE) over the complete bandwidth. This, 
however, does not limit the applicability of the algorithms 
as they operate on a per-user level and therefore can easily 
be extended to the case of mutiple users per RAP. The 
channel gain between RAP and UE remains fixed during each 
scheduled transmission (e.g., each subframe), and the channel 
is subject to additive white Gaussian noise. The instantaneous 
signal-to-interference-and-noise ratio (SINR) between RAP k 
and its UE for a transmission is given by 7 ^. The RAP chooses 
a data rate < log(l + 7 fc) (in units of bits per channel use 
(bpcu)) from a discrete set of Nn available modulation-and- 
coding schemes (MCSs). Let TZi = {ri ,.. be the set of 

rates allocated to each uplink for the resource allocation 
strategy. Then, the set TZ* = {TZi,.. .TZna) with Na < 
denotes the set of all feasible rate allocations. 

In 0, 0, the concept of computational outage and the 
trade-off between computational complexity and data rate was 
discussed using the model introduced by Grover et al. in 
Eo). Eor turbo-decoding (as e.g. used in LTE), the number of 
turbo-iterations required to successfully decode a codeword 
for a particular SINR 7 ^ depends strongly on the selected 
data rate r^. If is chosen close to the channel capacity 
log(l -f 7 fc), on average a large number of iterations will be 
required. However, as the allocated decreases for the given 
SINR, the number of required iterations also decreases. The 
overall computational complexity to process one codeword 
scales with the number of information bits that are processed 
and the number of iterations. The computational complexity 
can be computed as the product of the information bits and 
turbo-iterations required, divided by the number of channel 

uses, i. e. bit-iterations per channel use (pcu). 

This relationship is formalized by m Eq. (3)] 

= l08jc-l)[‘°®“ (uS?) 

( 1 ) 


where C is a parameter of the model related to the connectivity 
of the decoder when represented as a graph. 
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l0g2[l0g2(l+7fc)-Tfe], (3) 


K' is a parameter of the model, and Cchannei is a constraint on 
the channel outage probability. The set of parameters {K', 
is selected by calibrating O with an actual turbo-decoder 
implementation or message-passing decoder. In the remainder 
of this paper, we use the abbreviation Ck = C( 7 fc, r^). 


III. Resource Management in Cloud-RAN 

This section begins in Sec. IIII-AI with the derivation of 
an optimal water-filling scheduling policy when the rates are 
drawn from a continuum of possible rates. Next, in Sec. IIII-BI 
the optimization is modified to account for the more practical 
case of having an MCS scheme with only finite number of 
possible rates to pick from. Einally, in Sec. lIII-Cl an alternative 
low-complexity resource-allocation strategy is proposed that is 
based on an intuitive complexity cut-off approach. 


A. Max-Rate Optimization Problem 

Suppose that the Cloud-RAN platform supports a maximum 
computational complexity Cserver, i- e., — ^server must 

hold. Eurther, assume that the set of allocated rates may be 
drawn from a continuum of possibilities. Our objective is 
to maximize the sum-rate of the system while avoiding any 
computational outage, which can be formalized as follows; 

TZopi = argmax r^, (4) 

S.t. ^ ^ Ck Cseiven 
r^-GTZ 

where TZ' is the set of all rate allocations satisfying 0 < < 

log(l -I- 7 fe) for all k. 

This scheduling problem implies a trade-off between the 
invested computational complexity and the gained achievable 
rates. In order to solve the partial derivative dCk/drk 
is required, which determines how much the computational 
complexity varies when the data rate is changed. However, 
the derivative of Ck is rather complex and difficult to apply in 
this optimization problem. Therefore, we use the following 
piecewise linearization of l{yk,rk), which does not alter 
significantly the accuracy of the analysis: 

l{yk,rk) « akrk+hk, ( 5 ) 

where 

_ dl{yk,rk) _ _-1_ 

drk log( 2 )[log 2 (l + 7 fe)-rfe]’ 

h = log2 [log2(l + 7fe) - Tfc] - ttferfc. 

Using 0, we can rewrite 0 as 

Ck ~ akrl-\-Pkrk, 


(6) 
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Theorem 1: The solution to the rate allocation problem in 
(|4|i can be well approximated by 

where 0 < 77 < l/Pk and < Cserver- 

rk&Tl 

Proof: The proof follows similar arguments and method¬ 
ology of the power allocation problem im, which leads to the 
well known water-filling method. The details of the derivation 
are provided in Appendix ■ 

In (|7]i the parameter I /77 determines the water-level, which 
decides whether a UE is served or not, while Pk is the cost (in 
terms of complexity) of transmitting. If the difference between 
the selected rate and the capacity log(l -|- 7 /c) is small, then 
the term Pk becomes very large (due to the large slope of 
Ofc). Hence, the UE is unlikely to be served. The parameter 
Uk scales the rate, i. e. if again the rate operates close 
to capacity, ak will be also very large and therefore scales 
down the assigned data rate in order to reduce the necessary 
computational complexity. 


B. Application to Limited Number of Rates and Multiple Users 

The solution of (|4]i is not practical in the more realistic case 
that the allocated rate must be drawn from among a finite set 
of discrete MCS levels. Assume now that the allocation is over 
the set TZ* — {T^i,.. ■TZna} of feasible discrete allocations. 
The discretized allocation may be stated as: 


Tk 
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where (a) follows from Theorem [T] assuming that 0 < 77 < 
l//3fc and ( 6 ) results from (| 6 ll, assuming there is no constraint 
on the computational complexity. Note that in (|9]l and in the 
following equations, Ck is evaluated by ([ 1 ]). 

Combining ([ 8 ]l and (|9]), we can further state that 


1 

- > 

V 



( 10 ) 


— ‘iO-kCk + Pk, 


( 11 ) 


which gives us the required water-level for each user. 
Equations (fTTT l and (ISTT l (77 > 1/pk => fk = 0 ) yield 

^4.akCk+Pl >pk^rk = Q, ( 12 ) 

which is a condition that makes sure that the computational 
resources released are invested to those users, which can be 


served with higher assignable rates. 

Another drawback of © is that it does not assign the 
computational resources in a fair manner. In particular, the 
aforementioned solution always favors those users that have 
a higher SINR and it allows them to transmit at the highest 
rates, as can be seen from function while the other users 
might be dropped. 

In order to schedule the rates more fairly from the set of 
allowed values, the following iterative procedure can be used: 

1) Initialization: 

a) Set TZ such that each user k receives the maximum 
possible rate r^. 

b) Set rk = 0 for all users for which (fl^ is satisfied. 

2) Recursion: If Ck > Cserver then 

rk&TZ 

a) Compute (fTTT l for each user and select k* user with 
the highest value: k* = argmax^ ^AaCk + Pi- 

b) Decrease the rate for user k to the next lower MCS 
and update TZ accordingly. 

3) Decision: 

a) Halt the process if Ck < Cserver- Otherwise, 

I—I 

go back to step 

4) Attempt to serve dropped users: 

a) Eind the users for which = 0 and among them 
find k* such that k* = argmax^ y/4aCk + P'k- 

b) Assign to the user k* the rate provided in La and 
update TZ accordingly. 

c) Halt the process if ^ Ck > Cserver, otherwise go 

rk&Ti 

back to Hi] If Ck > Cserver Set Tfc. = 0, and 
Tk&n 

update TZ accordingly. 

In the previously described method, step 1-3 iteratively re¬ 
duce the MCS for the user with the highest value of (fTTI) . until 
the complexity constraint is satisfied. Step |4] attempts to assign 
eventual remaining computational resources to those users that 
were dropped even though their MCS were sufficiently high. 
In the following, we refer to this process as scheduling with 
water-filling (SWF). 

C. Complexity Cut-Off 

In this section, we introduce a slightly simpler and intuitive 
scheduling method. Instead of applying (|7]i, which requires 
the linearization described in dS in order to determine the 
parameters ak and Pk, we select the users with the highest 
complexity and reduce their MCS (and associated rate rk) 
until the sum-complexity constraint Cserver is fulfilled. In this 
case an iterative method, similar to one described in Sec. lIII-Bl 
can be used, which works as follows: 

1) Initialization: Set TZ such that each user k receives the 
maximum possible rate rk, 

2) Recursion: If Ck > Cserver then 

rk&TZ 

a) Select the user k* with the highest complexity: 
k* = argmaxfc Ck 















b) Decrease the rate for user k to the next lower MCS 
and update TZ accordingly 

3) Decision: Halt the process if Ck < Cserver- Other- 
wise, go back to step 2 . 

As the result of this procedure, we obtain a rate allocation 
TZ S TZ*, which satisfies the complexity constraint and 
always reduces the rate of those users that require the higher 
complexity. The main difference to the previously discussed 
allocation algorithm is that this is not necessarily sum-rate 
optimal. However, it is still very efficient as we will discuss 
in the next section. In the following, we refer to this method 
as scheduling with complexity cut-ojf (SCC). 

IV. System-level Evaluation 

In this section, we evaluate the performance of the proposed 
resource allocation strategies in terms of complexity and sum- 
rate. The performance of SWF, provided in Sec. IIII-BI is 
compared against SCC, described in Sec. IIII-CI In addition, 
the schedulers are compared against the benchmark maximum 
rate scheduling (MRS) policy 0 , which is the scheduler that 
simply sets each to its maximum value by selecting the 
maximimum rate that satisfies the outage constraint after Tmax 
decoder iterations for the given SINR 7 ^. The comparisons are 
made by using a system-level simulator that is compliant with 
the 3GPP LTE standard. 

A. System-Level Simulator 

We assume a 3GPP LTE system using adaptive modulation 
and coding based on turbo codes with overall 27 distinct MCSs 
{Nn = 27). The rate of the MCS is given by ||9l 

T* = log2 ^1 + , (13) 

where 7 /^ indicates the minimum SINR for which the 
MCS satisfies on average an outage constraint after the Lmax-th 
iteration, while 1 / is a parameter that models the gap between 
the capacity at 7 /* and the SINR for the actual code to meet 
the performance objective at rate r^. In the following, it is 
assumed that Lmax = 8 . The value of yf- for each MCS can 
be obtained as follows. Simulations are used to obtain transport 
block error rate (TBLER) curves for each possible MCS by 
setting an upper bound on the maximum number of turbo¬ 
iterations. For the MCS, 7 /^ is selected to be the value of 
SINR for which the TBLER satisfies a particular constraint 
for the channel outage fchannei- 

The parameter u together with the complexity model pa¬ 
rameters {K'X} are selected by statistically fitting the model 
with an actual LTE turbo-decoder. In particular, from 19) the 
best fit is given for K' — 0.2, C = 6 ^ and v = 0.2 dB. 

We consider a network composed of A"bs = 129 base 
stations (BSs) shown in Fig. |2] which is a segment of the 
actual deployment by a major provider in the UK at 1800 
MHz over a square arena of 30 x 30 km^. Assume that the 
Cloud-RAN platform centrally processes the uplink signals 
from the = 10 cells highlighted in yellow and the UEs are 



Fig. 2. BS locations with centrally processed RAPs highlighted in yellow. 


TABLE I 

Main parameters for the system level evaluation. 


Spatial distribution of users 

PPP 

Density of UE per unit area 

A = 1 UEs/Km^ 

Path loss exponent 

a = 3.7 

Number of centrally processed RAPs 

Nc = 10 

Computational outage 

e = [10, 

Channel outage constraint 

^channel = 10 % 

Fading 

Rayleigh 

Fractional Power-Control Factor 

s=0.1 

Transmit power 

Po = 10 W 

Noise power 

W = 100 mW 

Simulation trials 

A^trials = 10 


distributed according to a Poisson point process (PPP) with 
intensity A users per km^. 

Let Yk indicate the RAP that serves a cell with area 
Ak and its location, and Xk indicate the UE served by the 
kth j(s location. We assume the path loss from a 

mobile Xk to a BS Yk is \Y — X\~°‘, where a is the path-loss 
exponent. When fractional power control is used, a mobile’s 
transmit power is Pk = Po\Yk — XkY°‘, where Pq is the 
reference power (which is assumed as the power received 
at unit distance from the transmitter) and s, 0 < s < 1 , is 
the compensation factor for fractional power control. In the 
following, we assume s = 0 . 1 , which is the value reported in 
112 that maximizes the sum throughput. 

The fading power gain from Xi to Yk is assumed to be 
exponentially distributed with unit mean, corresponding to 
Rayleigh fading. Furthermore, assume that the fading power 
gains remain hxed for the duration of a transmission, but vary 
from user to user and from one transport block (TB) to the 
next (block fading). 

The sum-rate and the sum-complexity of the system are 
used in the following as performance metrics to compare the 
proposed allocation strategies with the benchmark scheduler. 
Both performance metrics are evaluated through simulations 
for each of the allocation strategies under examination as 
follows. During each trial, a mobile is placed at random in the 
k*^ cell with probability 1—exp(—AAlfc). Once the mobiles are 
placed, the fading coefficients are drawn from an exponential 

























(a) CDF of the computational complexity. 



Sum-Rate (bit pcu] 

(b) CDF of the sum-rate. 

Fig. 3. CDFs of the complexity and of the sum-rate. 

distribution and the SINR at each of the Nc RAPs is computed. 
By applying a given allocation strategy, we hnd the selected 
MCS for each RAP and the corresponding rate based on the 
quality of the channel. Using O, the complexity required 
to process the uplink signal of each UE is evaluated. The 
sum-rate and sum-complexity are now computed by summing 
up respectively the rates and complexity for all Nc RAPs. 
Once the allocation algorithm is applied, if X]/c > Cserver, a 
computational outage occurs and the sum-rate is set to zero. 

B. Numerical Results 

In this subsection, the parameters summarized in Table U are 
used in the system-level simulator, if not otherwise stated. 

Fig. |3] shows the cumulative distribution function (CDF) of 
both the achieved sum-rate and the required computational 
complexity over A); = 10 RAPs. The figure shows the curves 
for both SWF and SCC, when the system is designed such that 
e = 10%,l%,0.1% computational outage holds (the notches 
in the magnihcation of Fig. |3^a) show the corresponding 
value of Cserver)- Each of these values translates into a peak- 
requirement for computing resources given the respective 
outage constraint. Fig. [3] shows as a benchmark the curve for 
the unconstrained case, which selects the maximum possible 
rate that can be used since there is no computational constraint. 
Fig. |3!a) shows that a stronger constraint on the available 
computational resources implies higher computational outage. 
More importantly, this figure highlights that while the required 





Fig. 4. Sum-rate as a function of Nc- 



Fig. 5. Sum-rate as a function of the user-density A. 


computational complexity is signihcantly reduced by dimen¬ 
sioning the system to an higher computational outage, the sum- 
rate only decreases slightly for both SWF and SCC, i. e. the 
average sum-rate only decreases by 0.28% for e = 10% 
and by k, 0.07% for e = 0.1 %, as illustrated by Fig. [3b). 

This shows the efficiency of the proposed schedulers, which 
impact the achievable sum-rate only marginally, while they 
reduce the required computational resources significantly. Fur¬ 
thermore, Fig. 13 shows that the introduced schedulers are able 
to completely avoid any computational outage, which would 
lead in the worst case to drop the connection of UEs. Even 
if one solution could be to dimension the system for a very 
low computational outage, i. e. e = 10“® %, the drawback 
is that the system will be significantly over-provisioned and 
most of the time the allocated resources are under-utilized. By 
contrast, our schedulers allow to avoid computational outage, 
while maintaining a high server utilization. 

Fig. 13 shows the sum-rate as a function of the number 
of RAPs, which are centrally processed at the Cloud-RAN 
platform. The performance hgures are shown for SWF and 
SCC as well as for MRS, which does not account for the 
computational constraint. Fig. |3 shows the impact of the 
computational outage targets on the different schedulers. As 
it can be noticed, for all values of Nc, the impact of the 
constraint on the computational resources is marginal for the 
computational aware schedulers. By contrast, as e increases, 
the impact on a system, which uses MRS, increases linearly 




















































with e due to the increasing computational outage. Further¬ 
more, the magnification in Fig. |4] shows that there is only a 
marginal difference between SWF and the SCC, emphasizing 
the fact that the less complex SCC algorithm achieves almost 
the same performance as SWF. 

In mobile networks, it may happen that the traffic demand 
differs significantly over time. In this case the system experi¬ 
ences traffic peaks, e.g. when many people leave or join the 
metro or an event. This may lead to cases where the system 
experiences a higher computational demand than usual. Fig. 
|5] shows the sum-rate of SWF, SCC and MRS as function of 
the density of UEs per km^ when the maximum computational 
resources available lead to a computational outage of 10 % and 
0.1 % in the case of A = 0.5 UEs/km^. Eig.|5]shows the ability 
of our proposed schedulers to provide services to all the users 
while slightly reducing the system-throughput. Eurthermore, it 
is shown that the proposed scheduling algorithms are able to 
accommodate the increasing traffic demand, while the MRS 
suffers gradually from the higher computational outage. 


Using the Karush-Kuhn-Tucker conditions, it follows that 
dL 

V/c : — = 0 1 -p = p {2ark + Pk) (15) 

OTk 

Ck< Cserver (16) 

Vfc:rfe>O^V0fc>O:0fcrfc = O. (17) 

Eirst, lets assume that 7 ^ 0 0^ = 0, which follows 

from ([TtT i. Using (fTsT i and setting 0 ^ = 0 , it yields 

1 = Tj {2akrk + Pk) ■ (18) 

Erom ( fTSl l using ( fTfil l. it follows that (with p > 0 ) 

rk = tr—■ (19) 

2ak \r] ) 

Lets consider the case when 0^ 7 ^ 0 = 0, which 

again follows from ( fTTI i. Using (flSl l and rk = 0, it yields 

1 + ©fe = TlPk- (20) 


V. Conclusions 


The computational complexity of RAN functions is one 
of the main obstacles for the introduction of cloud com¬ 
puting principles into the mobile network radio access. In 
this paper, we have developed a framework, which solves 
the user resource allocation problem under the assumption 
of limited computational resources in a centralized cloud 
platform. We showed that the underlying optimization problem 
can be solved with an adapted water-filling approach, making 
it feasible to fulfill the strict timing requirements of the wire¬ 
less access (e.g., several milliseconds in LTE). Eurthermore, 
we have shown that an intuitive complexity-cut-off approach 
delivers near-to-optimal results as well. Einally, the numerical 
evaluation confirms that meeting computational complexity 
constraints does not lead to significant penalties in terms of 
throughput, a fact which underlines the applicability of the 
approach in practical systems. 

Appendix A 
Proof of Theorem[T] 


Proof: This section provides details leading to the solu¬ 
tion of the optimization problem given by (|4]i. Since both (| 6 ]l 
and the constrained functions in (|4]l have continuous first par¬ 
tial derivatives, this problem can be solved through the method 
of Lagrange multipliers. Given the Lagrange multipliers p and 
© = { 01 ,..., 0Ar,,}, the Lagrangian can be written as follows 


L(7^,??,©) 



-tr[©diag(rfc)]. 


The partial derivative of the Lagrangian over rk is 


dL 

drk 


-1-f 77--0 

ark 


-I+ 7] {2akrk + Pk) - 0 . 


Erom (I 20 I 1 and since in this case 0^ > 0 , it follows that 

’'"S' ™ 

By combining (fT^ and d 2 Tli . Theorem [T] is obtained. ■ 
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